self-supervised deep learning
- Europe > Germany > Brandenburg > Potsdam (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Self-Supervised Deep Learning on Point Clouds by Reconstructing Space
Point clouds provide a flexible and natural representation usable in countless applications such as robotics or self-driving cars. Recently, deep neural networks operating on raw point cloud data have shown promising results on supervised learning tasks such as object classification and semantic segmentation. While massive point cloud datasets can be captured using modern scanning technology, manually labelling such large 3D point clouds for supervised learning tasks is a cumbersome process. This necessitates methods that can learn from unlabelled data to significantly reduce the number of annotated samples needed in supervised learning. We propose a self-supervised learning task for deep learning on raw point cloud data in which a neural network is trained to reconstruct point clouds whose parts have been randomly rearranged. While solving this task, representations that capture semantic properties of the point cloud are learned. Our method is agnostic of network architecture and outperforms current unsupervised learning approaches in downstream object classification tasks. We show experimentally, that pre-training with our method before supervised training improves the performance of state-of-the-art models and significantly improves sample efficiency.
Provable Guarantees for Self-Supervised Deep Learning with Spectral Contrastive Loss
Recent works in self-supervised learning have advanced the state-of-the-art by relying on the contrastive learning paradigm, which learns representations by pushing positive pairs, or similar examples from the same class, closer together while keeping negative pairs far apart. Despite the empirical successes, theoretical foundations are limited -- prior analyses assume conditional independence of the positive pairs given the same class label, but recent empirical applications use heavily correlated positive pairs (i.e., data augmentations of the same image).
- Europe > Germany > Brandenburg > Potsdam (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Reviews: Self-Supervised Deep Learning on Point Clouds by Reconstructing Space
Originality: This paper is a novel combination of an existing method [7,21] for 2D images, to an existing task (point cloud feature learning). Given the success of [21], one would expect it also works for 3D representation where the spatial layout is equally or more important, which is confirmed by the results in this paper. The citations in this paper sufficiently cover related work. Quality: Most of the experimental results appear to be meaningful and support claimed advantages of this method: architecture-agnostic, avoids reconstruction metric, helps supervised down-stream tasks. But the comparison to alternative methods in Table 1 is weakened by the fact that model architectures used by the baseline methods are not mentioned.
Reviews: Self-Supervised Deep Learning on Point Clouds by Reconstructing Space
The paper received weak but still positive support from reviewers. The main concern was limited novelty on transferring work into 2D computer vision to 3D computer vision. However, the simplicity of the approach is a strength, and the approach seems to work well, which the reviewers generally agree on.
Self-Supervised Deep Learning on Point Clouds by Reconstructing Space
Point clouds provide a flexible and natural representation usable in countless applications such as robotics or self-driving cars. Recently, deep neural networks operating on raw point cloud data have shown promising results on supervised learning tasks such as object classification and semantic segmentation. While massive point cloud datasets can be captured using modern scanning technology, manually labelling such large 3D point clouds for supervised learning tasks is a cumbersome process. This necessitates methods that can learn from unlabelled data to significantly reduce the number of annotated samples needed in supervised learning. We propose a self-supervised learning task for deep learning on raw point cloud data in which a neural network is trained to reconstruct point clouds whose parts have been randomly rearranged.
Provable Guarantees for Self-Supervised Deep Learning with Spectral Contrastive Loss
Recent works in self-supervised learning have advanced the state-of-the-art by relying on the contrastive learning paradigm, which learns representations by pushing positive pairs, or similar examples from the same class, closer together while keeping negative pairs far apart. Despite the empirical successes, theoretical foundations are limited -- prior analyses assume conditional independence of the positive pairs given the same class label, but recent empirical applications use heavily correlated positive pairs (i.e., data augmentations of the same image). Edges in this graph connect augmentations of the same data, and ground-truth classes naturally form connected sub-graphs. We propose a loss that performs spectral decomposition on the population augmentation graph and can be succinctly written as a contrastive learning objective on neural net representations. Minimizing this objective leads to features with provable accuracy guarantees under linear probe evaluation.
Comments on 'Fast and scalable search of whole-slide images via self-supervised deep learning'
Sikaroudi, Milad, Afshari, Mehdi, Shafique, Abubakr, Kalra, Shivam, Tizhoosh, H. R.
Chen et al. [Chen2022] recently published the article "Fast and scalable search of whole-slide images via self-supervised deep learning" in Nature Biomedical Engineering. The authors call their method "self-supervised image search for histology", short SISH. The paper is not easily readable, and many important details are buried under ambiguous descriptions. Incremental modification of Yottixel - Yottixel introduced the concept of "mosaic" through a customized clustering and selection process [Kalra2020a]. While Chen et al. frequently mention "Yottixel" and "mosaic," they only acknowledge once that they have followed the Yottixel's mosaic generation process.
- North America > United States > Minnesota > Olmsted County > Rochester (0.05)
- North America > United States > District of Columbia > Washington (0.05)
- North America > Canada > Ontario > Waterloo Region > Waterloo (0.05)
Live 4D-OCT denoising with self-supervised deep learning
By providing three-dimensional visualization of tissues and instruments at high resolution, live volumetric optical coherence tomography (4D-OCT) has the potential to revolutionize ophthalmic surgery. However, the necessary imaging speed is accompanied by increased noise levels. A high data rate and the requirement for minimal latency impose major limitations for real-time noise reduction. In this work, we propose a low complexity neural network for denoising, directly incorporated into the image reconstruction pipeline of a microscope-integrated 4D-OCT prototype with an A-scan rate of 1.2 MHz. For this purpose, we trained a blind-spot network on unpaired OCT images using a self-supervised learning approach. With an optimized U-Net, only a few milliseconds of additional latency were introduced. Simultaneously, these architectural adaptations improved the numerical denoising performance compared to the basic setup, outperforming non-local filtering algorithms. Layers and edges of anatomical structures in B-scans were better preserved than with Gaussian filtering despite comparable processing time. By comparing scenes with and without denoising employed, we show that neural networks can be used to improve visual appearance of volumetric renderings in real time. Enhancing the rendering quality is an important step for the clinical acceptance and translation of 4D-OCT as an intra-surgical guidance tool.